AITopics | randomized tree

Country:

Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.04)
North America > United States > District of Columbia > Washington (0.04)
North America > United States > California > Monterey County > Monterey (0.04)
North America > Canada (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)

Neural Information Processing SystemsFeb-7-2026, 17:54:50 GMT

1cfa81af29c6f2d8cacb44921722e753-Paper.pdf

Then, wederivealocalMDI importance measure of variable relevance, which has a very natural connection withtheglobal MDImeasure andcanberelated toanewnotion oflocalfeature relevance. We further link local MDI importances with Shapley values and discuss them in the light of related measures from the literature.

artificial intelligence, imp, machine learning, (18 more...)

Country: Europe > Belgium (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.70)

Neural Information Processing SystemsNov-20-2025, 14:59:18 GMT

204da255aea2cd4a75ace6018fad6b4d-Paper.pdf

estimator, local average estimator, random forest, (15 more...)

Country:

Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.04)
North America > United States > District of Columbia > Washington (0.04)
North America > United States > California > Monterey County > Monterey (0.04)
(3 more...)

Genre: Research Report > New Finding (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Neural Information Processing SystemsOct-2-2025, 17:21:38 GMT

From global to local MDI variable importances for random forests and when they are Shapley values

Then, we derive a local MDI importance measure of variable relevance, which has a very natural connection with the global MDI measure and can be related to a new notion of local feature relevance. We further link local MDI importances with Shapley values and discuss them in the light of related measures from the literature.

artificial intelligence, machine learning, shapley value, (18 more...)

Country:

Europe > Belgium > Wallonia > Liège Province > Liège (0.04)
Asia > Middle East > Jordan (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.85)

Neural Information Processing SystemsSep-30-2025, 12:58:55 GMT

Understanding variable importances in forests of randomized trees

Despite growing interest and practical use in various scientific areas, variable importances derived from tree-based ensemble methods are not well understood from a theoretical point of view. In this work we characterize the Mean Decrease Impurity (MDI) variable importances as measured by an ensemble of totally randomized trees in asymptotic sample and ensemble size conditions. We derive a three-level decomposition of the information jointly provided by all input variables about the output in terms of i) the MDI importance of each input variable, ii) the degree of interaction of a given input variable with the other input variables, iii) the different interaction terms of a given degree. We then show that this MDI importance of a variable is equal to zero if and only if the variable is irrelevant and that the MDI importance of a relevant variable is invariant with respect to the removal or the addition of irrelevant variables. We illustrate these properties on a simple example and discuss how they may change in the case of non-totally randomized trees such as Random Forests and Extra-Trees.

input variable, randomized tree, variable importance, (3 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

#artificialintelligenceAug-24-2022, 13:31:26 GMT

Mango: a new way to make Bayesian optimisation in Python

Now, let's dive into Mango! In recent years, the amount of data has grown considerably. This represents a challenge for data scientists who need their machine learning pipelines to be scalable. Distributed computing might solve this issue. Distributed computing refers to a set of computers that work on a common task while communicating with each other.

bayesian optimization, hyperparameter, mango, (16 more...)

#artificialintelligence

Country: North America > United States > California (0.05)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.71)

Kim, Jungtaek, Choi, Seungjin

On Uncertainty Estimation by Tree-based Surrogate Models in Sequential Model-based Optimization

arXiv.org Machine LearningFeb-21-2022

Sequential model-based optimization sequentially selects a candidate point by constructing a surrogate model with the history of evaluations, to solve a black-box optimization problem. Gaussian process (GP) regression is a popular choice as a surrogate model, because of its capability of calculating prediction uncertainty analytically. On the other hand, an ensemble of randomized trees is another option and has practical merits over GPs due to its scalability and easiness of handling continuous/discrete mixed variables. In this paper we revisit various ensembles of randomized trees to investigate their behavior in the perspective of prediction uncertainty estimation. Then, we propose a new way of constructing an ensemble of randomized trees, referred to as BwO forest, where bagging with oversampling is employed to construct bootstrapped samples that are used to build randomized trees with random splitting. Experimental results demonstrate the validity and good performance of BwO forest over existing tree-based models in various circumstances.

bwo forest, mondrian forest, surrogate model, (12 more...)

2202.10669

Country:

North America > Canada > Quebec > Montreal (0.04)
North America > United States > Nevada (0.04)
North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
(10 more...)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.66)
(2 more...)

Sutera, Antonio, Louppe, Gilles, Huynh-Thu, Van Anh, Wehenkel, Louis, Geurts, Pierre

From global to local MDI variable importances for random forests and when they are Shapley values

arXiv.org Machine LearningNov-3-2021

Random forests have been widely used for their ability to provide so-called importance measures, which give insight at a global (per dataset) level on the relevance of input variables to predict a certain output. On the other hand, methods based on Shapley values have been introduced to refine the analysis of feature relevance in tree-based models to a local (per instance) level. In this context, we first show that the global Mean Decrease of Impurity (MDI) variable importance scores correspond to Shapley values under some conditions. Then, we derive a local MDI importance measure of variable relevance, which has a very natural connection with the global MDI measure and can be related to a new notion of local feature relevance. We further link local MDI importances with Shapley values and discuss them in the light of related measures from the literature. The measures are illustrated through experiments on several classification and regression problems.

artificial intelligence, machine learning, shapley value, (17 more...)

2111.02218

Country:

Oceania > Australia > New South Wales > Sydney (0.04)
Europe > Belgium > Wallonia > Liège Province > Liège (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

arXiv.org Machine LearningSep-29-2020

Selective Cascade of Residual ExtraTrees

Liu, Qimin, Liu, Fang

We propose a novel tree-based ensemble method named Selective Cascade of Residual ExtraTrees (SCORE). SCORE draws inspiration from representation learning, incorporates regularized regression with variable selection features, and utilizes boosting to improve prediction and reduce generalization errors. We also develop a variable importance measure to increase the explainability of SCORE. Our computer experiments show that SCORE provides comparable or superior performance in prediction against ExtraTrees, random forest, gradient boosting machine, and neural networks; and the proposed variable importance measure for SCORE is comparable to studied benchmark methods. Finally, the predictive performance of SCORE remains stable across hyper-parameter values, suggesting potential robustness to hyperparameter specification.

artificial intelligence, extratree, machine learning, (16 more...)

2009.14138

Country:

North America > United States > Tennessee > Davidson County > Nashville (0.04)
North America > United States > New York (0.04)
North America > United States > Indiana > St. Joseph County > Notre Dame (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.87)

Konstantinov, Andrei V., Utkin, Lev V.

Gradient boosting machine with partially randomized decision trees

arXiv.org Machine LearningJun-19-2020

The gradient boosting machine is a powerful ensemble-based machine learning method for solving regression problems. However, one of the difficulties of its using is a possible discontinuity of the regression function, which arises when regions of training data are not densely covered by training points. In order to overcome this difficulty and to reduce the computational complexity of the gradient boosting machine, we propose to apply the partially randomized trees which can be regarded as a special case of the extremely randomized trees applied to the gradient boosting. The gradient boosting machine with the partially randomized trees is illustrated by means of many numerical examples using synthetic and real data.

artificial intelligence, machine learning, randomized tree, (18 more...)

2006.11014

Country:

Asia > Russia (0.14)
North America > United States > California (0.05)
Oceania > Australia > Victoria > Melbourne (0.04)
(4 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.67)